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SURVEILLANCE SYSTEM AND METHOD 
HAVING AN ADAPTIVE SEQUENTIAL 
PROBABILITY FAULT DETECTION TEST 

CROSS-REFERENCE TO RELATED 
APPLICATIONS 

This application is a continuation of continuation patent 
application Ser. No. 1 1/027,998, filed Dec. 31, 2004 now U.S. 
Pat. No. 7,082,379 which claims the benefit and is a continu- 
ation of patent application Ser. No. 10/095,835, filed Mar. 8, 
2002, now U.S. Pat. No. 6,892,163, the entire disclosure of 
both incorporated herein by reference. 

STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH AND DEVELOPMENT 

The invention described herein was made in the perfor- 
mance of work under NASA Small Business Technology 
Transfer Research (STTR) Contract NAS8-98027, and is sub- 
ject to the provisions of Public Law 96-5 1 7 (35 USC 202) and 
the Code of Federal Regulations 48 CFR 52.227-1 1 as modi- 
fied by 48 CFR 1852.227-11, in which the contractor has 
elected to retain title. 

The United States Government has rights in this invention 
pursuant to Contract No. W-31-109-ENG-38 between the 
United States Government and the University of Chicago 
representing Argonne National Laboratory. 

FIELD OF THE INVENTION 

The instant invention relates generally to a system and 
method for process fault detection using a statistically based 
decision test and, in particular, to a system and method for 
performing high sensitivity surveillance of an asset such as a 
process and/or apparatus wherein the surveillance is per- 
formed using an adaptive sequential probability (ASP) fault 
detection test comprised of a probability density function 
model empirically derived from a numerical analysis of asset 
operating data. 

BACKGROUND OF THE INVENTION 

Conventional process surveillance schemes are sensitive 
only to gross changes in the mean value of a process signal or 
to large steps or spikes that exceed some threshold limit value. 
These conventional methods suffer from either a large num- 
ber of false alarms (if thresholds are set too close to normal 
operating levels) or from a large number of missed (or 
delayed) alarms (if the thresholds are set too expansively). 
Moreover, most conventional methods cannot perceive the 
onset of a process disturbance or sensor signal error that gives 
rise to a signal below the threshold level or an alarm condi- 
tion. Most conventional methods also do not account for the 
relationship between measurements made by one sensor rela- 
tive to another redundant sensor or between measurements 
made by one sensor relative to predicted values for the sensor. 

Recently, improved methods for process surveillance have 
developed from the application of certain aspects of artificial 
intelligence technology. Specifically, parameter estimation 
methods have been developed using either statistical, math- 
ematical or neural network techniques to learn a model of the 
normal patterns present in a system of process signals. After 
learning these patterns, the learned model is used as a param- 
eter estimator to create one or more predicted (virtual) signals 
given a new observation of the actual process signals. Further, 
high sensitivity surveillance methods have been developed 
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for detecting process and signal faults by analysis of a math- 
ematical comparison between an actual process signal and its 
virtual signal counterpart. In particular, such a mathematical 
comparison is most often performed on a residual error signal 
5 computed as, for example, the difference between an actual 
process signal and its virtual signal counterpart. 

Parameter estimation based surveillance schemes have 
been shown to provide improved surveillance relative to con- 
ventional schemes for a wide variety of assets including 
to industrial, utility, business, medical, transportation, financial, 
and biological systems. However, parameter estimation 
based surveillance schemes have in general shown limited 
success when applied to complex processes. Applicants rec- 
ognize and believe that this is because the parameter estima- 
15 tion model for a complex process will, in general, produce 
residual error signals having a non-Gaussian probability den- 
sity function. Moreover, a review of the known prior-art dis- 
closes that virtually all such surveillance systems developed 
to date utilize or assume a Gaussian model of the residual 
20 error signal probability density function for fault detection. 
Hence, a significant shortcoming of the known prior-art is 
that, inter alia, parameter estimation based surveillance 
schemes will produce numerous false alarms due to the mod- 
eling error introduced by the assumption of a Gaussian 
25 residual error signal probability density function. The impli- 
cation for parameter estimation based surveillance schemes is 
that the fault detection sensitivity must be significantly 
reduced to prevent false alanns thereby limiting the utility of 
the method for process surveillance. An alternative for statis- 
30 tically derived fault detection models is to mathematically 
pre-process the residual error signals to remove non-Gauss- 
ian elements prior to using the residual error signals in the 
fault detection model; however this approach requires an 
excess of additional processing and also limits the sensitivity 
35 of the surveillance method. Therefore, the implication of 
assuming a Gaussian residual error signal probability density 
function for a parameter estimation based surveillance 
scheme is simply that the system becomes less accurate 
thereby degrading the sensitivity and utility of the surveil- 
40 lance method. 

Many attempts to apply statistical fault detection tech- 
niques to surveillance of assets such as industrial, utility, 
business, medical, transportation, financial, and biological 
processes have met with poor results in part because the fault 
45 detection models used or assumed a Gaussian residual error 
signal probability density function. 

In one specific example, a multivariate state estimation 
technique based surveillance system for the Space Shuttle 
Main Engine’s telemetry data was found to produce numer- 
50 ous false alarms when a Gaussian residual error fault detec- 
tion model was used for surveillance. In this case, the surveil- 
lance system’s fault detection threshold parameters were 
desensitized to reduce the false alarm rate; however, the 
missed alarm rate then became too high for practical use in the 
55 telemetry data monitoring application. 

Moreover, current fault detection techniques for surveil- 
lance of assets such as industrial, utility, business, medical, 
transportation, financial, and biological processes either fail 
to recognize the surveillance performance limitations that 
60 occur when a Gaussian residual error model is used or, rec- 
ognizing such limitations, attempt to artificially conform the 
observed residual error data to fit the Gaussian model. This 
may be attributed, in part, to the relative immaturity of the 
field of artificial intelligence and computer-assisted surveil- 
65 lance with regard to real-world process control applications. 
Additionally, a general failure to recognize the specific limi- 
tations that a Gaussian residual error model imposes on fault 
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decision accuracy for computer-assisted surveillance is punc- 
tuated by an apparent lack of known prior art teachings that 
address potential methods to overcome this limitation. In 
general, the known prior-art teaches computer-assisted sur- 
veillance solutions that either ignore the limitations of the 
Gaussian model for reasons of mathematical convenience or 
attempt to conform the actual residual error data to the arti- 
ficial Gaussian model, for example, by using frequency 
domain filtering and signal whitening techniques. 

For the foregoing reasons, there is a need for a surveillance 
system and method that overcomes the significant shortcom- 
ings of the known prior-art as delineated hereinabove. 

BRIEF SUMMARY OF THE INVENTION 

The instant invention is distinguished over the known prior 
art in a multiplicity of ways. For one tiling, an embodiment of 
the instant invention provides a surveillance system and 
method having a fault detection model of unconstrained prob- 
ability density function form and having a procedure suitable 
for overcoming a performance limiting trade-off between 
probability density function modeling complexity and deci- 
sion accuracy that has been unrecognized by the known prior- 
art. Specifically, an embodiment of the instant invention can 
employ any one of a plurality of residual error probability 
density function model forms, including but not limited to a 
Gaussian form, thereby allowing a surveillance system to 
utilize the model form best suited for optimizing surveillance 
system performance. 

Moreover, an embodiment of the instant invention provides 
a surveillance system and method that uses a computer-as- 
sisted learning procedure to automatically derive the most 
suitable form of the residual error probability density func- 
tion model by observation and analysis of a time sequence of 
process signal data and by a combination of a plurality of 
techniques. This ability enables surveillance to be performed 
by the instant invention with lower false alarm rates and lower 
missed alarm rates than can be achieved by the known prior- 
art systems and methods. 

Thus, an embodiment of the instant invention provides a 
surveillance system and method that performs its intended 
function much more effectively by enabling higher decision 
accuracy. Further, an embodiment of the instant invention is 
suitable for use with a plurality of parameter estimation meth- 
ods thereby providing a capability to improve the decision 
performance of a wide variety of surveillance systems. 

In one embodiment, the instant invention provides a sur- 
veillance system and method that creates and uses, for the 
purpose of process surveillance, a multivariate state estima- 
tion technique parameter estimation method in combination 
with a statistical hypothesis test fault detection method hav- 
ing a probability density function model empirically derived 
from a numerical analysis of asset operating data. 

Particularly, and in one embodiment, the instant invention 
provides a surveillance system and method for providing 
surveillance of an asset such as a process and/or apparatus by 
providing training and surveillance procedures. 

In accordance with one embodiment, the instant invention 
provides a training procedure comprised of a calibrate param- 
eter estimator procedure and a calibrate fault detector proce- 
dure. 

The calibrate parameter estimator procedure creates a 
parameter estimation model(s) and trains a parameter estima- 
tion model by utilizing training data correlative to expected 
asset operation and, for example, utilizing a multivariate state 
estimation technique (MSET) procedure. The calibrate 
parameter estimator procedure further stores this model as a 
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new parameter estimation model in a process model. The 
process model may contain one or more parameter estimation 
models depending upon design requirements. Additionally, 
each parameter estimation model contained in the process 
5 model may be created to implement any one of a plurality of 
parameter estimation techniques. 

The calibrate fault detector procedure makes use of the 
parameter estimation models to provide estimated values for 
at least one signal parameter contained in the training data, 
to Generally, the calibrate fault detector procedure will create a 
separate and distinct fault detection model for each sensor or 
data signal associated with the asset being monitored for the 
presence of fault conditions during the surveillance proce- 
dure. 

15 The calibrate fault detector procedure includes, for 
example, a method of fitting a standard Gaussian probability 
density function (PDF) to a training data residual distribution 
(computed as a function of estimated process parameters and 
the training data) and then adding successive higher order 
20 terms of a remainder function to the standard Gaussian PDF 
for the purpose of defining a general PDF that better fits the 
computed training data residual distribution. Other tech- 
niques for fitting a general PDF to the training data are simi- 
larly feasible and useful in accordance with the instant inven- 
25 tion and include, for example, a technique for fitting a 
polynomial function to the training data. 

The training procedure is completed when all training data 
has been used to calibrate the process model. At this point, the 
process model preferably includes parameter estimation 
30 models and fault detection models for each sensor or data 
signal associated with the asset being monitored for the pres- 
ence of fault conditions during the surveillance procedure. 
The process model is thereafter used for performing surveil- 
lance of the asset. 

35 The surveillance procedure is performed using an adaptive 
sequential probability (ASP) fault detection test comprised of 
the general probability density function model empirically 
derived from a numerical analysis of the asset training data. 

The surveillance procedure acquires and digitizes current 
40 asset data and then estimates process parameters as a function 
of the acquired digitized current asset data and the parameter 
estimation model(s) obtained from the calibrate parameter 
estimator procedure. Then, fault detection is determined by 
first computing data residuals as a function of the estimated 
45 process parameters and the acquired digitized current asset 
data and then performing the ASP test(s) as a function of the 
fault detection models and thus, as a function of the fitted 
general PDF obtained in the calibrate fault detector proce- 
dure. Each ASP test returns one of three possible states: a not 
50 null state which rejects the probability that a null hypothesis 
is true and excepts an alternative hypothesis correlative to 
unexpected operation of the asset; a null state which accepts 
the probability that a null hypothesis is true and excepts the 
null hypothesis correlative to expected operation of the asset; 
55 and an in-between state which excepts neither the null 
hypothesis nor the alternative hypothesis as being true and 
requires more data to reach a conclusion. 

The results of the fault detection procedure are then ana- 
lyzed according to the instant invention such that an alarm 
60 and/or a control action are taken when the analysis determines 
that the results indicate unexpected operation of the asset. 

Hence, the instant invention is distinguished over the 
known prior-art by providing a surveillance system and 
method for performing surveillance of an asset by acquiring 
65 residual data correlative to expected asset operation; fitting a 
mathematical model to the acquired residual data; storing the 
mathematical model in a memory means; collecting a current 
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set of observed signal data values from the asset; using the 
mathematical mode in a sequential hypotheses test to deter- 
mine if the current set of observed signal data values indicate 
unexpected operation of the asset indicative a fault condition; 
and outputting a signal correlative to each detected fault con- 
dition for providing asset surveillance. 

Accordingly, the instant invention provides a new, novel 
and useful surveillance system and method having an adap- 
tive sequential probability fault detection test. 

In one embodiment of the instant invention the surveillance 
system and method includes an unconstrained fonn of a 
residual error probability density function model used in said 
surveillance system’s fault detection method. 

In one embodiment of the instant invention the surveillance 
system and method can perform high sensitivity surveillance 
for a wide variety of assets including industrial, utility, busi- 
ness, medical, transportation, financial, and biological pro- 
cesses and apparatuses wherein such process and/or appara- 
tus asset preferably has at least one pair of redundant actual 
and/or virtual signals. 

In one embodiment of the instant invention the surveillance 
system and method includes a statistical hypothesis test sur- 
veillance decision procedure that uses a fault detection model 
comprised of a probability density function model of a 
residual error signal that is of an unconstrained form. 

In one embodiment of the instant invention the surveillance 
system and method creates an improved fault detection model 
for a process surveillance scheme using recorded operating 
data for an asset to train a fault detection model. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for surveillance of on-line, real-time signals, or off-line accu- 
mulated signal data. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for surveillance of signal sources and detecting a fault or error 
state of the signal sources enabling responsive action thereto. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for surveillance of signal sources and detecting a fault or error 
state of the asset processes and apparatuses enabling respon- 
sive action thereto. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved decision as to the 
accuracy or validity for at least one process signal parameter 
given an observation of at least one actual signal from the 
asset. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for ultra-sensitive detection of a fault or error state of signal 
sources and/or asset processes and apparatuses wherein the 
parameter estimation technique used for the generation of at 
least one virtual signal parameter is a multivariate state esti- 
mation technique (MSET) having any one of a plurality of 
pattern recognition matrix operators, training procedures, and 
operating procedures. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for ultra-sensitive detection of a fault or error state of signal 
sources and/or asset processes and apparatuses wherein the 
parameter estimation technique used for the generation of at 
least one virtual signal parameter is a neural network having 
any one of a plurality of structures, training procedures, and 
operating procedures. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for ultra- sensitive detection of a fault or error state of signal 
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sources and/or asset processes and apparatuses wherein the 
parameter estimation technique used for the generation of at 
least one virtual signal parameter is a mathematical process 
model having any one of a plurality of structures, training 
5 procedures, and operating procedures. 

In one embodiment of the instant invention the surveillance 
system and method provides an improved system and method 
for ultra-sensitive detection of a fault or error state of signal 
sources and/or asset processes and apparatuses wherein the 
1° parameter estimation technique used for the generation of at 
least one virtual signal parameter is an autoregressive moving 
average (ARMA) model having any one of a plurality of 
structures, training procedures, and operating procedures. 

In one embodiment of the instant invention the surveillance 
1 5 system and method provides an improved system and method 
for ultra-sensitive detection of a fault or error state of signal 
sources and/or asset processes and apparatuses wherein the 
parameter estimation technique used for the generation of at 
least one virtual signal parameter is a Kalman filter model 
20 having any one of a plurality of structures, training proce- 
dures, and operating procedures. 

In one embodiment of the instant invention the surveillance 
system and method provides a novel system and method for 
using at least one of a plurality of methods to classify the state 
of a residual error signal produced by the mathematical dif- 
ference between two signals, said two signals being either 
actual and/or predicted signals, into one of at least two cat- 
egories. 

, ( . In one embodiment of the instant invention the surveillance 

system and method provides a novel system and method to 
classify the state of a residual error signal wherein said clas- 
sification is made to distinguish between a normal signal and 
a abnormal signal. 

35 In one embodiment of the instant invention the surveillance 

system and method provides a novel system and method to 
classify the state of a residual error signal wherein said clas- 
sification is performed using a statistical hypothesis test hav- 
ing any one of a plurality of probability density function 
40 models, training procedures, and operating procedures. 

In one embodiment of the instant invention the surveillance 
system and method provides a novel system and method to 
classify the state of a residual error signal wherein said clas- 
sification is performed using a probability density function 
45 model having any one of a plurality of structures, training 
procedures, and operating procedures. 

Accordingly, having thus summarized the invention, it 
should be apparent that numerous modifications and adapta- 
tions may be resorted to without departing from the scope and 
50 fair meaning of the present invention as set forth hereinbelow 
by the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

55 FIG. 1 is a block diagram of a surveillance system of an 
embodiment in accordance with the instant invention. 

FIG. 2 is a schematic functional flow diagram of an 
embodiment in accordance with the instant invention. 

60 FIG. 3 is a schematic functional flow diagram of a method 
and system for training a process model consisting of at least 
one parameter estimation model and at least one fault detec- 
tion model using recorded observations of the actual process 
signals in accordance with the instant invention. 

65 FIG. 4 is a schematic functional flow diagram of a method 
and system for the fault detection model training procedure in 
accordance with the instant invention. 
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FIG. 5 is a schematic functional flow diagram of a preferred 
method and system for performing surveillance of an asset 
using at least one parameter estimation model and at least one 
fault detection model in accordance with the instant inven- 
tion. 

FIG. 6 is a schematic functional flow diagram of a method 
and system for the fault detection surveillance procedure in 
accordance with the instant invention. 

FIG. 7 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a redundant sensor. 

FIG. 8 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a generalized parameter estimation model, 
such as a multivariate state estimation model, a neural net- 
work model, an analytical model, or a Kalman filter model. 

FIG. 9 is a flow diagram of a training and surveillance 
procedure in accordance with the instant invention. 

FIG. 10 illustrates characteristics of the null and alternate 
hypotheses for the prior-art sequential probability ratio test 
(SPRT) mean tests. 

FIG. 11 illustrates characteristics of the null and alternate 
hypotheses for the prior-art SPRT variance tests. 

FIG. 12 illustrates acquired operating data, estimated 
parameter data, and residual error data for a typical Space 
Shuttle Main Engine accelerometer. 

FIG. 13 illustrates a probability density function of the 
residual error data for a typical Space Shuttle Main Engine 
accelerometer with comparison to a Gaussian probability 
density function. 

FIG. 14 illustrates an un-optimized one-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 15 illustrates an un-optimized two-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 16 illustrates an un-optimized three-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 17 lists the root mean square errors for five different 
un-optimized probability density function models computed 
for each of six Space Shuttle Main Engine accelerometers 
used for feasibility testing of a preferred embodiment in 
accordance with the instant invention. 

FIG. 18 lists the root mean square errors for five different 
optimized probability density function models computed for 
each of six Space Shuttle Main Engine accelerometers used 
for feasibility testing of a preferred embodiment in accor- 
dance with the instant invention. 

FIG. 19 illustrates the optimized two-term expansion prob- 
ability density function model of the residual error data for a 
typical Space Shuttle Main Engine accelerometer with com- 
parison to the actual residual error data and a Gaussian prob- 
ability density function. 

FIG. 20 illustrates the empirical false alarm rates for fea- 
sibility testing of a preferred embodiment in accordance with 
the instant invention with comparison to the prior art SPRT 
method. 
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DETAILED DESCRIPTION OF THE INVENTION 

Considering the drawings, wherein like reference numer- 
als denote like parts throughout the various drawing figures, 
5 reference numeral 10 is directed to the system according to 
the instant invention. 

In its essence, and referring to FIGS. 1 and 2, the system 10 
is generally comprised of a method and apparatus for per- 
forming high sensitivity surveillance of a wide variety of 
to assets including industrial, utility, business, medical, trans- 
portation, financial, and biological processes and apparatuses 
wherein such process and/or apparatus asset preferably, has at 
least one distinct measured or observed signal or sequence 
comprised of characteristic data values which are processed 
15 by the system 10 described herein for providing ultra-sensi- 
tive detection of the onset of sensor or data signal degrada- 
tion, component performance degradation, and process oper- 
ating anomalies. The system 1 0 includes a training procedure 
30 carried out on a computer 22 such that a process model 50 
20 of an asset 12 (e.g., a process and/or apparatus) is stored in an 
associated memory means 24 after being learned from his- 
torical operating data using at least one of a plurality of 
computer-assisted procedures in accordance with the instant 
invention. The historical operating data includes a set orrange 
25 of observations from expected or typical operation of the 
asset 12 that are acquired and digitized by a data acquisition 
means 20 and stored in a memory means 24 as training data 34 
by using any combination of electronic data acquisition hard- 
ware and signal processing software 20 known to those hav- 
30 ing ordinary skill in the art, and informed by the present 
disclosure. As will be delineated in detail infra, one hallmark 
of the instant invention is the process model 50 forthe asset 12 
that is derived during the training procedure 30. 

The system 1 0 further includes a surveillance procedure 60 
35 wherein the stored process model 50 is used for high sensi- 
tivity computer-assisted surveillance of the asset 12 for the 
purpose of determining whether a process fault or failure 
necessitates an alarm or control action. The process model 50 
is comprised of a parameter estimation model 52 or collection 
40 of parameter estimation models 52 as necessary to provide an 
estimated value for each sensor or data signal 14 of asset 12 to 
be monitored for the presence of fault conditions during the 
surveillance procedure 60. The process model 50 is further 
comprised of a fault detection model 54 or collection of fault 
45 detection models 54 such that at least one fault detection 
model 54 is provided for each sensor or data signal 14 of asset 
1 2 to be monitored for the presence of fault conditions during 
the surveillance procedure 60. The fault detection model 54 is 
another hallmark of the instant invention and will be delin- 
50 eated in further detail hereinbelow. 

Process Model Training Procedure: 

More specifically, and referring to FIGS. 1 and 3, the 
training procedure 30 of the system 10 includes a method and 
apparatus for training or preparing the process model 50 
55 using historical operating data from the asset 12 that has been 
acquired by the data acquisition means 20 using any combi- 
nation of conventional electronic data acquisition hardware 
and signal processing software as is well known in the art. The 
historical operating data is acquired in digital format and 
60 stored in memory means 24 using a data storage procedure 32 
to create a training data set 34. 

The training data set 34 includes at least N discrete obser- 
vations of the asset 12 wherein each single observation, 
herein denoted Xobs, is comprised of a vector of data values 
65 for each signal parameter to be included in the process model 
50. For the purposes of the training procedure 30, the number 
of observations, N, acquired is at least great enough to 
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adequately bound the operating state space of the asset 12. 
Thus, the training data set 34 provides a representative sample 
of the signals produced by the asset 12 during all normal 
modes of operation. 

Upon acquiring the training data set 34, the unique training 5 
procedure 30 can be implemented in accordance with instant 
invention. 

The unique training procedure 30 is comprised of a cali- 
brate parameter estimator procedure 36 and a calibrate fault 
detector procedure 38. The calibrate parameter estimator pro- to 
cedure 36 creates the parameter estimation model 52 and 
trains the parameter estimation model 52 using the training 
data 34. The calibrate parameter estimator procedure 36 fur- 
ther stores this model as a new parameter estimation model 52 
in the process model 50. 15 

The process model 50 may contain one or more parameter 
estimation models 52 depending upon the requirements of the 
approach taken by a designer. Continuing to refer to FIG. 3, 
the training procedure 3 0 may be, in general, performed using 
any parameter estimation method suitable for defining a 20 
parameter estimation model 52 useful for estimating the val- 
ues of one or more process signals. Methods suitable for the 
calibrate parameter estimator procedure 36 include, but are 
not limited to, a plurality of redundant sensor techniques, a 
plurality of multivariate state estimation techniques, a plural- 25 
ity of neural network techniques, a plurality of mathematical 
model techniques, a plurality of autoregressive moving aver- 
age techniques, and a plurality of Kalman filter techniques. 
Each parameter estimation model 52 contained in the process 
model 50 may be created to implement any one of a plurality 30 
of parameter estimation techniques. Further, the parameter 
estimation technique implemented for an individual param- 
eter estimation model 52 is not constrained to be the same as 
the parameter estimation technique implemented for any 
other parameter estimation model 52 contained in the process 35 
model 50. 

One example of the calibrate parameter estimator proce- 
dure 36 would be the computation of a bias term between two 
redundant sensors wherein the parameter estimation model 
52 used for estimating the value of one sensor during the 40 
surveillance procedure 60 consisted of summing the observed 
value of a second redundant sensor with a bias tenn computed 
during the training procedure 30 as the mean difference 
between the two sensor values over the training data set 34. 
More sophisticated examples of the training procedure 30 45 
using multivariate state estimation techniques will be 
described herein below. 

Still referring to FIG. 3, the calibrate fault detector proce- 
dure 38 makes use of the parameter estimation models 52 to 
provide estimated values for at least one signal parameter 50 
contained in the training data. Generally, the calibrate fault 
detector procedure 38 will create a separate and distinct fault 
detection model 54 for each sensor or data signal of asset 12 
to be monitored for the presence of fault conditions during the 
surveillance procedure 60. As delineated infra, one hallmark 55 
of the instant invention is the fault detection model 54 element 
of the process model 50 for the asset 12 that is derived during 
the training procedure 30. In particular, the instant invention 
encompasses a statistical hypothesis test type of fault detec- 
tion model 54 having novel and unique characteristics and 60 
calibration procedures described herein including but not 
limited to having a probability density function model 
empirically derived from a numerical analysis of asset oper- 
ating data. 

Continuing to refer to FIG. 3, the training procedure 30 is 65 
completed when all training data 34 has been used to calibrate 
the process model 50. At this point, the process model 50 
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includes parameter estimation models 52 and fault detection 
models 54 for each sensor or data signal of asset 12 to be 
monitored for the presence of fault conditions during the 
surveillance procedure 60. The process model array 50 is 
thereafter useful for performing surveillance of the asset 12. 

Referring to FIG. 4, the training procedure 30 is illustrated 
in additional detail. A designer 16 initializes the calibrate 
parameter estimators procedure 36 by specifying a set of 
parameter estimator methods and settings 40. The parameter 
estimator methods and settings 40 are then used to operate on 
the training data 34 via a nominal signal behavior modeling 
procedure 42, for example using an MSET training procedure 
as described herein in detail, in order to create the parameter 
estimation models 52, which are stored in the process model 
50. 

Still referring to FIG. 4, the training procedure 30 next 
proceeds to the calibrate fault detectors procedure 38 wherein 
the parameter estimation models 52 are an input to the pro- 
cedure. The designer 16 initializes the calibrate fault detec- 
tors procedure 38 by specifying a set of fault detector methods 
and settings 46. Next, the estimate process parameters proce- 
dure 64 operates the parameter estimation models 52 over the 
training data 34 to generate an estimated value for each moni- 
tored signal value contained in the training data 34. It is 
important that the estimate process parameters procedure 64 
used in the calibrate fault detectors procedure 38 be the same 
estimate process parameters procedure 64 that will later be 
used in the surveillance procedure 60 (reference FIGS. 1 and 
2 for surveillance procedure 60). Next, A compute training 
data residuals procedure 44 calculates the training data 
residuals for each monitored signal, which are calculated as 
the difference between the training data value and the corre- 
sponding estimated data value for each monitored signal . The 
training data residuals are next used by a compute nominal 
residual probability density function (PDF) procedure 48 to 
create a fault detection model 54 for each monitored signal . In 
accordance with the instant invention, the fault detection 
models 54 is typically comprised of mathematical descrip- 
tions of the probability density function that best character- 
izes or best fits the training data residual for the monitored 
signal. The training data is presumed to accurately character- 
ize the expected normal operating states of the asset 12. 
Therefore, the training data residuals are characteristic of the 
expected normal deviations between the observed signal val- 
ues and the values estimated using the parameter estimation 
models 52. The fault detection models 54 are stored in the 
process model 50 thereby completing the training procedure. 

One hallmark of the instant invention is the method and 
system for computing the fault detection models 54 by the 
means of the compute nominal residual probability density 
function (PDF) procedure 48. As will be described math- 
ematically herein below, the compute no min al residual prob- 
ability density function (PDF) procedure 48 fits a general 
open-ended probability function to the training data residuals 
and employs this fitted function when implementing a herein 
named Adaptive Sequential Probability (ASP) method and 
system for computing the fault detection model 54 and there- 
after employing said fault detection model 54 for the purpose 
of performing a fault detection procedure 66 of the surveil- 
lance procedure 60. 

Surveillance Procedure: 

Referring to FIG. 5, the surveillance procedure 60 is com- 
prised of acquiring successive vectors of current operating 
data and determining for each such observation vector 
whether the current operating data is indicative of a fault or 
failure of the asset 12. The surveillance procedure 60 further 
includes implementing an alarm or control action 70 for the 
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purpose of notifying an operator and/or taking a corrective 
action in response to a detected fault or failure of the asset 12. 
The surveillance procedure 60 is in general an open-ended 
data acquisition and analysis loop that continues until such 
time as the operator chooses to terminate the surveillance 74. 5 

More specifically, and referring to FIG. 5, the surveillance 
procedure 60 begins 58 with an acquire current operating data 
procedure 62 that employs the data acquisition and digitiza- 
tion means 20 (FIG. 1) to acquire a current set of signal data 
from the monitored asset 12. The current set of signal data is to 
provided to the estimate process parameters procedure 64 that 
uses the parameter estimation models 52 to estimate values 
for one or more of the current signal data values . The observed 
and estimated data are next provided to a perform fault detec- 
tion procedure 66 that uses the fault detection models 54 to 15 
determine whether a fault is found 68 in the current operating 
data. If a fault is found 68 is true, an alarm and/or control 
action 70 is taken. Upon completing the fault found procedure 
68, the surveillance procedure 60 then repeats for the next 
available set of signal data for as long as a surveillance com- 20 
plete decision procedure 72 determines that additional sur- 
veillance data are available or terminates at surveillance com- 
plete step 74 when no more surveillance data are available or 
when terminated by an operator. 

Referring now to FIG. 6, the perform fault detection pro- 25 
cedure 66 of surveillance procedure 60 is illustrated in addi- 
tional detail. For each current set of signal data values 
acquired the estimate process parameters procedure 64 uses 
the parameter estimation models 52 to estimate values for one 
or more of the current signal data values. The compute data 30 
residuals procedure 76 performs a mathematical transforma- 
tion on the acquired and estimated values to produce a current 
set of residual data values. Said mathematical transformation 
is most typically a simple mathematical difference, however, 
any appropriate transformation may be used including trans- 35 
formations that remove correlated and uncorrelated noise 
from the residual data values. The residuals produced and 
transformed in the compute data residuals procedure 76 are 
next processed by a perform ASP tests procedure 78 that uses 
the fault detection models 54 to produce a ASP fault indica- 40 
tionthat is a hallmark of the method and system of the instant 
invention. Next, the fault found decision procedure 68 is 
performed on the basis of the ASP fault indication results 
produced by the perform ASP tests procedure 78. The fault 
found decision procedure 68 may have any one of a plurality 45 
of structures and procedures, including but not limited to 
methods and systems to perform false alarm filtering by 
means of observing a time series of ASP fault indication 
results for the purposes of determining the actual presence of 
a fault. In one preferred embodiment of the instant invention, 50 
a conditional probability fault found decision procedure 68 is 
used to perform said false alarm filtering. 

Continuing to refer to FIG. 6, the estimate process param- 
eters procedure 64 uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 55 
values wherein the parameter estimation method used may 
have any one of a plurality of structures and procedures, 
including but not limited to, a plurality of redundant sensor 
techniques, a plurality of multivariate state estimation tech- 
niques, a plurality of neural network techniques, a plurality of 60 
mathematical model techniques, a plurality of autoregressive 
moving average techniques, and a plurality of Kalman filter 
techniques. 

Referring to FIG. 7, one possible redundant sensor tech- 
nique for the estimate process parameters procedure 64 is 65 
illustrated. The acquire current operating data procedure 62 is 
used to acquire current signal data values from signals 14 
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monitored from asset 12 via sensors 18. The estimated value 
for a first redundant sensor signal is computed using a math- 
ematical transformation on the acquired value of a second 
redundant sensor signal. Said mathematical transformation is 
the estimate process parameters procedure 64 that in this case 
may be a simple equivalence or may include biasing, de- 
noising or other signal processing. The compute data residu- 
als procedure 76 is then performed followed by the perform 
ASP tests procedure 78 as described hereinabove and further 
delineated hereinbelow. 

Referring to FIG. 8, one possible multivariable parameter 
estimation technique for the estimate process parameters pro- 
cedure 64 is illustrated. The acquire current operating data 
procedure 62 is used to acquire current signal data values 
from signals 14 monitored from asset 12 via sensors 18. The 
estimated value for one or more sensor signals is computed 
using a mathematical transformation on the acquired values 
of one or more sensor signals. Said mathematical transforma- 
tion is the estimate process parameters procedure 64 that in 
this case may implement any feasible parameter estimation 
technique or procedure, including but not limited to a plural- 
ity of multivariate state estimation techniques, a plurality of 
neural network techniques, a plurality of mathematical model 
techniques, and a plurality of Kalman filter techniques. The 
compute data residuals procedure 76 is then performed fol- 
lowed by the perform ASP tests procedure 78 as described 
hereinabove and further delineated hereinbelow. 

Referring again to FIG. 6, the usefulness of the instant 
invention is, inter alia, the improvement achieved in the accu- 
racy of the fault decision procedure 68 that results from the 
improvement achieved in the accuracy of perform fault detec- 
tion procedure 66 made possible by the novel perform ASP 
tests procedure 78 that is a hallmark of the instant invention. 
Improving the accuracy of the fault decision procedure 68 
accomplishes a reduction in the number of false alarms sent to 
a process operator or control system that can in turn result in 
an erroneous alarm or control action by the alarm or control 
action procedure 70. Further, improving the accuracy of the 
fault decision procedure 68 accomplishes a reduction in the 
number of missed alarms thereby accomplishing more timely 
alann or control action by the alarm or control action proce- 
dure 70. The instant invention thereby enables improved 
operating safety, improved efficiency and performance, and 
reduced maintenance costs for a wide variety of industrial, 
utility, business, medical, transportation, financial, and bio- 
logical processes and apparatuses wherein such process and/ 
or apparatus asset 12 has at least one characteristic data signal 
suitable for surveillance. 

In use and operation, and in one preferred form, FIGS. 1 
and 9 outline a general surveillance procedure of the system 
10 when employing the novel fault detection model 54 con- 
tained in the process model 50 and the accompanying novel 
fault detection procedure 66 having the perform ASP tests 
procedure 78. In a typical surveillance procedure, the asset 12 
is the source of at least one process signal 14 that is acquired 
and digitized using conventional data acquisition means 20 
for providing the data acquisition procedure for the purpose 
of computer-assisted surveillance. The digitized signal data is 
generally evaluated using a computer 22 having computer 
software modules implementing the estimate process param- 
eters procedure 64, and the perform fault detection procedure 
66. The estimate process parameters procedure 64 is used to 
produce an estimated signal value for at least one process 
signal 14 emanating from the asset 12. Hie estimate process 
parameters procedure 64 in general makes use of the process 
model 50 stored in a memory means 24 associated with the 
computer 22 to produce the estimated signal values. The 
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estimated signal values are then generally evaluated using the 
perforin fault detection procedure 66 to identify faults or 
operating anomalies in the asset 12 . The results of the fault 
detection evaluation are thereafter communicated by a con- 
ventional communications means 26 (as is known to those 
having ordinary skill in the art, and informed by the present 
disclosure) to an operator console 80 and/or automated pro- 
cess control system 82 for possible alarm and/or control 
action. 

The computer 22 along with the associated memory means 
24 can also be employed to perform the training and surveil- 
lance procedures 30 , 60 as delineated hereinabove and to 
produce and store all the data associated with these proce- 
dures, for example, the historical training data, designer 
defined settings, and the process model. 

Multivariate State Estimation Technique (MSET) for Esti- 
mate Process Parameters Procedure: 

In one embodiment of the instant invention, the estimate 
process parameters procedure 64 uses a multivariate state 
estimation technique (MSET) procedure having an MSET 
parameter estimation model 52 structure. TheUS Department 
of Energy’s Argonne National Laboratory originally devel- 
oped the implementation of MSET described herein for sur- 
veillance of sensors and components in nuclear power plant 
applications. Elowever, other implementations of a multivari- 
ate state estimation technique are possible and useful in con- 
junction with the instant invention. MSET is in general a 
statistically derived parameter estimation algorithm that uses 
advanced pattern recognition techniques to measure the simi- 
larity or overlap between signals within a defined domain of 
asset operation (set of asset operating states). MSET “learns” 
patterns among the signals by numerical analysis of historical 
asset operating data. These learned patterns or relationships 
among the signals are then used to identify the learned state 
that most closely corresponds with a new signal data obser- 
vation. By quantifying the relationship between the current 
and learned states, MSET estimates the current expected 
response of the asset signals. MSET then uses threshold com- 
parisons or a form of statistical hypothesis testing, such as a 
Sequential Probability Ratio Test (SPRT) as disclosed in U.S. 
Pat. No. 5,459,675 which is hereby incorporated by reference 
in its entirety or the Adaptive Sequential Probability (ASP) 
procedure that is a hallmark of the instant invention to com- 
pare the current estimated value of a signal with its observed 
value. The MSET procedure provides an accurate and widely 
applicable method to estimate the operating signal values for 
an asset. Elowever, other implementations of the parameter 
estimation procedure are possible and useful in conjunction 
with the instant invention. 

An MSET model is created for the asset 12 using the MSET 
training algorithms to learn the inherent data relationships 
within a set of historical asset operating data. The trained 
MSET model is then used with the MSET parameter estima- 
tion and fault detection algorithms to perform the process 
surveillance function when presented with a new observation 
of signal data values. The following sections will first provide 
a mathematical overview of the MSET algorithms and pro- 
cedures useful for training a parameter estimation model and 
for using this trained model for process surveillance. The 
description is followed by a detailed description of a preferred 
embodiment of the instant invention using a novel Adaptive 
Sequential Probability (ASP) procedure for fault detection 
during process surveillance. 

MSET Training and Parameter Estimation Procedures: 

Hie MSET methods are generally described in the follow- 
ing two US Government documents produced and main- 
tained by the US Department of Energy’s Argonne National 
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Laboratory, Argonne, 111., disclosure of which is incorporated 
in its entirety herein by reference. 

J. P. Herzog, S. W. Wegerich, R. M. Singer, and K. C. 
Gross, “Theoretical Basis of the Multivariate State Estima- 
5 tion Technique (MSET),” Argonne National Laboratory, 
ANL-NT-49, December 1997. 

J. P. Herzog, S. W. Wegerich, K. C. Gross, and R. M. 
Singer, “MSET: Code Structure and Interface Development 
Guide,” ANL-NT-48, August 1997. 

10 The MSET algorithm uses pattern recognition with histori- 

cal operating data from an asset to generate a parameter 
estimation model. If data is collected from a process over a 
range of operating states, this data can be arranged in matrix 
form, where each column vector (a total of m) in the matrix 
1 11 represents the measurements made at a particular state. Thus, 
this matrix will have the number of columns equal to the 
number of states at which observations were made and the 
number of rows equal to the number of measurements (a total 
of n signal data values) that were available at each observa- 
20 tion. We begin by defining the set of measurements taken at a 
given time t ; as an observation vector X(t,), 

^(0M*i(0)> *2 ({,). • • • • *Jfj)] T (El) 

25 where x,(y is the measurement from signal i at time t,. We 
then define the data collection matrix as the process memory 
matrix D: 
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Each of the column vectors (X(t,)) in the process memory 
matrix represents an operating state of the asset. Any number 
of observation vectors can be assigned to the process memory 
matrix. Training an MSET model includes collecting enough 
40 unique observation vectors from historical operation of the 
asset during normal conditions such that the process memory 
matrix encompasses the Hill dynamic operating range of the 
asset. Computation of the D matrix is the first of three steps in 
the method for training an MSET model based on historical 
45 operating data. 

One of at least two algorithms is used by MSET to select 
the vectors in the D matrix. The MinMax algorithm extracts 
vectors that bound the vector space defined by the training 
data and returns the smallest process memory matrix that will 
50 produce an effective system model (see also U.S. Pat. No. 
5,764,509 which is hereby incorporated by reference in its 
entirety). A vector ordering algorithm selects and includes 
representative vectors from the inner regions of the vector 
space producing a more accurate system model. 

55 Once the process memory matrix has been constructed, 
MSET is used to model the dynamic behavior of the system. 
For each current observation of the system (Xobs), MSET 
compares the observation vector to the stored operating states 
to calculate an estimate of the process parameter values. The 
60 parameter estimate of the current process state (Xest) is an 
n-element vector that is given by the product of the process 
memory matrix and a weight vector, W: 

X„=-W (E3) 

65 

The weight vector represents a measure of similarity 
between the estimate of the current state and the process 
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memory matrix. To obtain the weight vector, we minimize the 
error vector, R, where: 

R=X^-X„, (E4) 

The error is minimized for a given state when: 

^'=( T ®r l -( T ®X oi J (E5) 

This equation represents a “least squares” minimization 
when the pattern recognition operator @ is the matrix dot 
product. Several advanced pattern recognition operators have 
been defined that provide excellent parameter estimation per- 
formance. Pattern recognition operators used by MSET 
include, but are not limited to, the System State Analyzer 
(SSA) method (see also U.S. Pat. No. 4,937,763 which is 
hereby incorporated by reference in its entirety), the Bounded 
Angle Ratio Test (BART) method (see also U.S. Pat. No. 
5,987,399 which is hereby incorporated by reference in its 
entirety), the Vector Pattern Recognizer (VPR) method, the 
Vector Similarity Evaluation Technique (VSET) method, and 
the Probabilistic State Estimation Method (PSEM). 

Once the weight vector is found, the resulting current state 
estimate of the system (i.e., the parameter estimate vector) is 
given by: 

X es ,=DiD T ®D)-‘-(D T ®X ol , s ) (E6) 25 

The first application of the pattern recognition operator in 
equation E 6 (D r (x) D) involves a comparison between the 
row vectors in the D r matrix and each of the column vectors 
in the D matrix. If we define G=D r (x) D, then G, the similar- 30 
ity matrix, is an m by m matrix. The element in the i-th row 
and j -th column of the matrix (g, .) represents a measure of the 
similarity between the i-th and j-th column vectors (i.e., 
memorized states) in the process memory matrix. The second 
application of the pattern recognition operator in equation E 6 35 
(D r (x)X ofc ) involves a comparison between the row vectors 
in the D r matrix and each of the elements in the observation 
vector Xobs. If we define A=D r (x) X obs , then A, the similarity 
vector, is an m by 1 vector. Each element in the similarity 
vector is a measure of the similarity between the observation 40 
vector and the i-th column vector (i.e., memorized state) in the 
process memory matrix. 

Note that the similarity matrix is a function of the process 
memory matrix only. Thus, the similarity matrix and its 
inverse Ginv=(D r (x)D )" 1 can be calculated as soon as the 45 
process memory matrix has been derived thereby making the 
application of MSET to an on-line surveillance system more 
computationally efficient. Computation of the Ginv matrix 
initializes the parameter estimation model and completes the 
second of three steps in the procedure for training an MSET 50 
model based on historical operating data. 

The third and final step in the MSET training procedure 
includes analyzing the historical training data using equation 
E4 to produce a residual error vector, R, for each observation 
vector in the training data. The collection of residual error 55 
vectors comprises the training data residuals necessary for 
training the fault detection model 54 using any one of a 
plurality of techniques, including but not limited to the SPRT 
technique, and the novel ASP technique that is a hallmark of 
the instant invention. 60 

Hie Sequential Probability Ratio Test (SPRT) technique is 
a statistical hypothesis test fault detection algorithm histori- 
cally used for MSET process surveillance. The SPRT tech- 
nique is described in U.S. Pat. No. 5,459,675, which is incor- 
porated herein by reference in its entirety. The SPRT analyzes 65 
a sequence of discrete residual error values from a signal to 
determine whether the sequence is consistent with normal 
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signal behavior or with some other abnormal behavior. When 
the SPRT reaches a decision about the current signal behav- 
ior, e.g., that the signal is behaving normally or abnormally, 
the decision is reported and the test continues analyzing the 
signal data. For any SPRT, signal behavior is defined to be 
normal when the signal data adheres to a Gaussian probability 
density function (PDF) with mean 0 and variance a 2 . Normal 
signal behavior is referred to as the null hypothesis, H 0 . 
MSET employs four specific SPRT hypothesis tests. Each test 
determines whether current signal behavior is consistent with 
the null hypothesis or one of four alternative hypotheses. The 
four tests are known as the positive mean test, the negative 
mean test, the nominal variance test, and the inverse variance 
test. For the positive mean test, the corresponding alternative 
hypothesis, H x , is that the signal data adhere to a Gaussian 
PDF with mean +M and variance a 2 . For the negative mean 
test, the corresponding alternative hypothesis, H 2 , is that the 
signal data adheres to a Gaussian PDF with mean -M and 
variance a 2 . For the nominal variance test, the corresponding 
alternative hypothesis, H 3 , is that the signal data adheres to a 
Gaussian PDF withmeanOandvarianceVcr. Forthe inverse 
variance test, the corresponding alternative hypothesis, H 4 , is 
that the signal data adheres to a Gaussian PDF with mean 0 
and variance c r/V. The user-assigned constants M and V 
control the sensitivity of the tests. 

Limitations of the SPRT Fault Detector Training and Sur- 
veillance Method and System: 

One significant shortcoming of the SPRT technique is 
found in the assumptions underlying its mathematical formu- 
lation. Specifically, the SPRT technique presumes that the 
residual error signals adhere to a Gaussian probability density 
function. For residual error signals that are non-Gaussian, the 
fault detector false alarm rates and/or missed alarm rates 
specified by the designer are not accomplished by the SPRT 
procedure thereby degrading the fault decision accuracy of 
the asset control and/or surveillance system. The novel ASP 
technique of the instant invention specifically removes the 
assumption that the residual error signals adhere to a Gauss- 
ian probability density function. The ASP technique imple- 
ments any one of a plurality of methods to numerically fit a 
probability density function to the observed residual error 
signal distribution that is characteristic of normal asset opera- 
tion. The derived probability density function is then used to 
perform a dynamic statistical hypothesis test thereby achiev- 
ing the fault detector false alarm and missed alarm rates 
specified by the designer and improving the fault decision 
accuracy of the asset control and/or surveillance system. 

Fault Detection Using Statistical Hypothesis Test Proce- 
dures: 

The general theory underlying the statistical hypothesis 
test will now be delineated below. Next, the SPRT implemen- 
tation of a dynamic statistical hypothesis test will be 
described. Finally, the novel ASP implementation of a 
dynamic statistical hypothesis test for non-Gaussian residual 
error signals will be delineated in detail along with a delin- 
eation of its reduction to practice. 

Bayes’ Rule for a Single Observation: 

Statistical decision problems in which there are just two 
possible outcomes constitute an important class called binary 
hypothesis testing problems. The possible states of a system 
are called hypotheses and each individual state of the system 
is termed a simple hypothesis. A simple hypothesis is a com- 
plete specification of a probability distribution for the system 
(i.e., the distribution of possible observations or samples from 
the system). The “hypothesis” being tested is that the particu- 
lar distribution is the correct one. 
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The basic operation in a binary hypothesis test is to evalu- 
ate the veracity of a hypothesis, H, given a piece of evidence 
or observation, e, from the system being studied. Because of 
the unpredictability or element of chance inherent in the 
system, the test deals with the probabilities that events occur 5 
or that hypotheses are true. The probability that a hypothesis 
is true given a piece of evidence is written as P(Hle). The 
notation identifies a conditional probability — namely the 
probability that the hypothesis is true under the condition that 
the event has occurred with absolute certainty. Bayes’ well to 
known inversion rule, as described in Probabilistic Reasoning 
in Intelligent Systems: Networks of Plausible Inference, by 
Judea Pearl, Morgan Kaufmann Publishers, Inc., San Mateo, 
Calif., 1988, Second Edition at pages 29 through 39, provides 
a means for evaluating the conditional probability of a 15 
hypothesis, P(Hle), which is also known as the posterior 
probability, 


A- H\e) = 


Pie I - H) 

P(e | H)P(H) + P(e | H)P(-^ H ) ' 


(E12) 


An alternate form of Bayes’ rule is produced by dividing it 
(i.e., equation El 1) with its complementary form (i.e., equa- 
tion El 2) to obtain 


Pin I e) Pie | H)P[H) (E13) 

P(^H\e) P(e | -i H)F\-> H) 


This form of Bayes’ rule is further manipulated by first 
defining the prior odds on hypothesis H as 


P(H | e) = 


Pje | H)P(H) 
Pie ) 


(E7) 20 


0(H) = 


P(H ) 
P(^HY 


(E14) 


P(elH) is the probability that the observation would occur 
if the hypothesis is true. P(H) is the probability that the 
hypothesis is true before any observations of the system have 
been made, also known as the prior probability. The denomi- 
nator, P(e), is the unconditional probability that the observa- 
tion occurs. 

Basic axioms of set theory can be used to prove the follow- 
ing identity for two events R and S: R=(RnS)U(Rn _l S), 
where _1 S is the converse of event S. In probability theory, the 
analog of this identity is 

P(R)=P(R,S)+P(R,~'S), (E8) 

where the notation P(R,S) is used represent the probability 35 
of the joint event RflS. The multiplication law states that the 
probability of two events occurring jointly can be expressed 
as a function of the conditional probability of one event based 
on the other, 

40 

P(R,S)=P(R\S)P(R). (E9) 

If R and S are independent events, then P(RIS)=P(R) and 
the multiplication law simplifies to P(R,S)=P(R) P(S). 

Bayes’ rule can be simplified by eliminating the denomi- 
nator in equation E7. Combining equations E8 and E9, and 45 
substituting e for R and H for S, the denominator can be 
written as 

P(e)=P(e\H)P(H)+P(e\~ l H)PC'H) (E 10 ) 

Therefore, Bayes’ rule becomes 50 


P(e\H)P(H) (Ell) 

P(e | H)P(H) + P(e p H)PCH ) ' 


The power of Bayesian techniques comes primarily from 
the fact that in causal reasoning the relationship P(elH) is 
local, namely, given that H is true, the probability of e can be 
estimated naturally and is not dependent on many other 60 
propositions. For instance, given that the measurements from 
an experiment adhere to a particular PDF, the probability that 
any single measurement will occur is easily computed. 

The complementary form of Bayes’ rule provides the pos- 
terior probability for the converse of the hypothesis. It is 65 
evaluated by substituting “'ll for each instance of H in equa- 
tion El 1 and noting that _ '( _ 'H)=H:] 


the likelihood ratio as 


Ue\H) = 


Pie I H) 

P(e | ’ 


and the posterior odds on H as 


(E15) 


0(H | e) = 


P(H | e) 
P(-i H \e)‘ 


(El 6) 


Bayes’ rule then specifies the posterior odds as the product 
of the likelihood ratio and the prior odds, 

0(H\e)=L(e\H)0(H) (E17) 

Bayes’ Rule for a Time Series: 

The formulation above specifies Bayes’ rule for a single 
observation and a binary hypothesis. For the application of 
binary hypothesis tests to real world signals, the formulation 
must be able to handle a sequence of discrete observations. 
This is accomplished by beginning with a single observation 
and successively updating Bayes’ rule for each successive 
observation. Let the sequence {Y„} be an ordered set of n 
elements. {Y re }=y 1; y 2 , , y n . in which the elements are 

observations of the signal made at n discrete moments in time 
such that t 1 <t 2 < . . . <t„. Bayes’ rule for the first observation 
(yj in the time series is 


P(H\yO = 


Piy i I H)P(H) 
Piyi) 


(El 8) 


Adding the second observation from the time series, 
Bayes’ rule for the joint event y i Cry 2 is 


P{H\y u y 2 ) = 


P(yi.y 2 \H)P(H) 

Piyi.y 2 ) 


(El 9) 
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With the aid of the multiplication law (equation E9), the 
joint event probabilities are converted to conditional prob- 
abilities so that the right hand side of this equation can be 
rewritten as 


„ „ , P(yz I H)P(y 1 | H)P(H) (E20) 

P(H yi, > 2 ) = s , — i — rs — ; ■ 

Piyi I yi)P(yi) 

Note that the probability of the joint event y j Oy 2 is written 
as P(y 15 y 2 )=P(y 2 lyi) p (yi) instead of the equivalent P(y,, 
y 2 )=P(y 1 ly 2 )P(y 2 ) because of the temporal dependency of the 
data. The second form of the multiplication law reduces to 
P(y i , y 2 ) = P(y i)P(y 2 ) because earlier events (e.g., y ,) in a time 
series cannot be dependent on later events (e.g., y 2 ). 

The multiplication law is used for each successive obser- 
vation in the time series to derive the fomi of Bayes’ rule for 
the joint event (y 1 ('ly 2 n . . . fly M ): 

P(H\y u y l y„)= (E21.1 

P(yn\yn-i,... ,yi,H) ... P(y 2 l.vi. »)P(.vi I hiP(H) 

P(yn I y n -i yi) ■■■ P(yz\yi)P(yi) 

Since the probabilities for each of the later observations in 
the time series are conditioned on earlier observations, 
Bayes’ rule is difficult to solve in the general case. But if the 
observations in the series are independent of each other (i.e., 
random), then the probabilities will be dependent on the 
hypothesis only. In this case the conditional probability for 
the ith observation, P(y ! ly,._ 1 , . . . , y 1; H), is simply P(y,IH). 
Thus, Bayes’ rule for independent observations in a time 
series is 


" (E24) 

In [0(H \yi,y 2 , ... , ,V„)1 = ln[0(//)] 4- ^ ln[A.V, I H)] 

: = l 

5 

Sequential Hypothesis Tests: 

Wald first presented and studied the following sequential 
test of a simple hypothesis against a simple alternative, as 
to described in Sequential Analysis, by A. Wald, Joint Wiley and 
Sons, Inc., New York, 1947. Let H 0 be a specific probability 
density function called the null hypothesis. Then the prob- 
ability that the time series { Y„} contains samples drawn from 
H 0 is P(y,, y 2 , . . . , y„IH 0 ). Let H^. be a different probability 
15 density function called the alternative hypothesis. Then the 
probability that the time series {Y„} contains samples drawn 
from H t is P(y l5 y 2 , . . . , y„IH ). Two threshold limits A and B 
are chosen, withA<B, and after each observation in the series 
the following statistic (A.^) is calculated: 

20 

= A.v.-.vz, ... ,y„\ Hj) (E25) 

J '“ Piyi,y 2 y n \H 0 )' 

25 

The test procedure is then as follows. If the statistic is 
greater than or equal to the upper threshold limit (i.e., 
A^lgB), then a decision is made to accept hypothesis H 7 as 
true. If the statistic is less than or equal to the lower threshold 
30 limit (i.e., A 2 „S A), then a decision is made to accept hypoth- 
esis H 0 as true. If the statistic falls between the two limits (i.e., 
A<A y „<B), then neither hypothesis can yet be accepted to be 
true and sampling continues. If the observations in the series 
are independent random variables, then the test statistic 
35 reduces to a product of conditional probabilities: 


P(H\yi,y 2 y„) = 


(E22) 


P(y n | H) ... P(yi | H)P(tf) = A P(y,- 1 H I 

P(.v„) ... A.Vi ) {J PW 40 



P(y, I Hj) 
P)y, I Ho)’ 


(E26) 


If an explicit time dependency can be established for the 
observations in the time series (i.e., a function is found that 
relates earlier events to later events), then the general form of 45 
Bayes’ rule (equation E21) can be used to develop failure 
detectionmodels for serially-correlated signals. However, the 
residual signals formed by the difference between the 
observed and estimated signal values are in general random 
signals; thus, Bayes’ rule for random time series is used as the 50 
basis for the fault detection models. 

Dividing by the complementary form of Bayes’ rule for 
random time series and utilizing the definition of the posterior 
odds, prior odds, and likelihood ratio from above, an alternate 
form of Bayes’ rule for a time series is developed: 55 


0{H\y l ,y 2 ... , y„) = 0(H)Y] Uyi I H). 


(E23) 


60 


If we take the logarithm of this equation, the incremental 
nature of Bayesian formulation becomes more apparent. 
Equation E24 shows the log of the likelihood ratio as a 
weight, carried by each observation in the sequence, which 65 
additively sways the belief in the hypothesis one way or the 
other. 


In the sequential hypothesis tests, the logarithm of the test 
statistic is often easier to work with: 


InA j,n = 



P(yi I Hj) ~ 
P(y,\Ho)/ 


(E27) 


The sequential hypothesis test consists of calculating the 
logarithm of the test statistic for each observation in the series 
and comparing the result to the logarithms of the lower and 
upper threshold limits. 

The statistic in the sequential hypothesis test is a product of 
a sequence of likelihood ratios. Each term in the product is the 
ratio of a probability conditioned on one hypothesis to a 
probability conditioned on a second hypothesis. The differ- 
ence between the likelihood ratios in the sequential hypoth- 
esis tests and those in Bayes’ rule (see equations El 5 and 
E23), is that in Bayes’ rule the probabilities are conditioned 
on a hypothesis and its converse, whereas in the sequential 
hypothesis tests the probabilities are conditioned on two 
hypotheses from a set of related hypotheses. In principle, the 
two hypotheses in the sequential hypothesis tests could be the 
converse of each other (i.e., the set contains two elements), 
but in practice the hypotheses are selected from an infinite set 
of exhaustive and mutually exclusive hypotheses. For 
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instance, suppose the null hypothesis is a Gaussian PDF with 
a mean of 1 and a variance 10. Then a sequential hypothesis 
test can be defined in which the alternate hypothesis is any 
Gaussian PDF in which the mean is not 1 and/or the variance 
is not 10. 5 

It is informative to compare the sequential hypothesis tests 
to Bayes’ rule for a time series. Given a null hypothesis Fl 0 
and an alternative hypothesis H^, the likelihood ratio for an 
observation e conditioned on the two hypotheses is defined as 


Lj(e | Ho) = 


P(e | Hj) 

Pie | Ho)' 


(E28) 


The subscripts emphasize the fact that the two hypotheses 
are selected from an infinite set of related hypotheses. The 
prior odds for the two hypotheses are defined as 


Oj(H 0 ) ■■ 


PiHj) 
Pi Ho)' 


(E29) 
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while the posterior odds are defined as 


25 


OjiHo | e) = 


PiHj 1 e) 
Pi «ok)' 


(E30) 


Assuming the observations in the time series are indepen- 
dent, then Bayes’ rule conditioned on hypothesis H ; (i.e., 
equation E22 with the symbol H replaced by H ; ) can be 
divided by Bayes’ rule conditioned on hypothesis H 0 to pro- 
duce Bayes’ rule for a sequential hypothesis test: 


n 

OjiHo | yu yi yn) = Oj(W 0 )P[ Ljiy, \ Ho). 


(E31) 
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hypothesis test makes a decision to accept the null hypothesis 
FI 0 as true when the alternative hypothesis H^. is true, then an 
error of type II is said to have occurred. Although the desig- 
nation is arbitrary, it stems from situations in which one kind 
of error is more serious than the other. Since the normal usage 
of the sequential hypothesis tests is to detect a change in 
signal response from its normal behavior (i.e., hypothesis H 0 ) 
to some abnormal behavior (i.e., hypothesis FI A the error of 
accepting I \ f when H 0 is true is the more serious error. The 
probability that a decision to accept hypothesis H is errone- 
ous is denoted by a. A type I decision error is also called a 
false alarm and the probability of a type I error is called the 
false alarm probability. The probability that a decision to 
accept hypothesis Fl 0 is erroneous is denoted by p. A type II 
decision error is also called a missed alarm and the probability 
of a type II error is called the missed alarm probability. 

The sequential hypothesis tests are open-ended. The tests 
will continue to collect observations from the system and 
update the test statistic until the test statistic satisfies one of 
the two decision conditions. In principle, the number of 
observations needed to reach a decision can be any positive 
integer, although it can be shown that a decision will be 
reached in a finite number of observations. Since the number 
of observations needed to make a decision is indeterminate, 
the probability that a decision is erroneous is found by sum- 
ming the probability of an erroneous decision being made 
after 1 observation, 2 observations, and so on. Formally, in 
terms of the threshold limits A and B that define the test, the 
false alarm probability is given by: 

u=PiAj I ssB\H 0 )+PiB<Ajj<4,A Jt2 B\H 0 ) (E33) 

The first term in the sum is the probability that the test 
statistic drops below the lower threshold limit after only one 
observation given that the alternative hypothesis is true. The 
second term is the probability that the test statistic drops 
below the lower threshold limit after two observations given 
that the alternative hypothesis is true. Similarly, the missed 
alarm probability is given by: 

$=P(A M SA\Hj)+P(B<Aj a <A,A ja SA\H J )+ .... (E34) 


Dividing this equation through by the prior odds, it 
becomes apparent that the statistic in the sequential hypoth- 
esis test is just the ratio of the posterior odds to the prior odds: 


These expressions are by no means easily computed. 
Moreover, one could not hope to solve these equations for A 
and B in terms of given a and (1, despite the desirability of 
being able to do so in setting up a test to provide a specified 
protection. Although these equations cannot be solved, it can 
be shown that the error probabilities and the threshold limits 
are related by the following inequalities 
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OjiHo | yi,y 2 , ... , y„) 
OjW 


= f] Lj(y, | W 0 ) 3 A j, 


(E32) 


The prior odds are the ratio of the probability that the 
alternative hypothesis is true to the probability that the null 
hypothesis is true, before any data have been collected from 
the system. In many cases, these probabilities are not 
known — no prior information about the system is known. In 
the absence of prior information about the system, these 
probabilities are taken to be Vi , making the prior odds equal to 
1 . Thus in the absence of prior information, the test statistic 
A Ul is equal to the odds that the system adheres to the alter- 
native PDF as opposed to the null PDF. 

Error Probabilities for the Sequential Hypothesis Tests: 

Because the sequential hypothesis tests make decisions 
based on probabilities, there is always a finite probability that 
a decision reached by the test is erroneous. If a sequential 
hypothesis test makes a decision to accept the alternative 
hypothesis H ' ■ as true when the null hypothesis H 0 is true, then 
an error of type I is said to have occurred. If a sequential 


1 p (E35) 

2010 — ^ — . 


The error probabilities and the threshold limits are related 
55 by inequalities because the test statistic K j n does not usually 
attain exactly the value A or the value B when the test is 
completed. But since a decision is declared as soon as an 
observation drives the test statistic past either threshold, the 
inequalities are almost equalities. Indeed, in practice, A and B 
60 are taken to be equal to a/(l-|3) and (1 — a)/(3, respectively. 
Doing so, of course, means that the sequential hypothesis test 
actually carried out has error probabilities that are somewhat 
different than those specified. Let a' and (f denote the empiri- 
cal error probabilities actually attained by a test using speci- 
65 lied threshold limits of A=a/(1-|3) and B=(l-a)/fl. Then 
according to the inequalities in equation E35, the empirical 
error probabilities (i.e., a' and (!') are related to the preas- 
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signed error probabilities (i.e., the values of a and (1 used to 
specify the threshold limits) by 


or' ^ a ^ ft' 1 ft (E36) 5 

a^W) - A = J^p) wd -~b = (T^o)' 


Multiplying these through to eliminate denominators, one 
obtains 

a'-a'psa-ap' and p'-ap'gp-a’p. (E37) 

Adding these two equations together, one obtains an 
inequality relating the empirical error probabilities to the 
preassigned error probabilities: 

a'+P'ga+p. (E38) 

One of the key features of the hypothesis test technique is 
that the designer can specify the error probabilities. This is 
particularly important for type 1 errors, because false alarms 
can cause an operator to make an incorrect decision. Type II 20 
errors typically do not lead to incorrect decisions. This is 
because in the event that a real failure does occur, missed 
alarms may delay the time to detect the failure but not the 
ability to detect the failure. The result above (equation E38) 
shows that the preassigned false alarm probability is not a 25 
strict upper limit for the empirical false alarm probability. 
Similarly, the preassigned missed alarm probability is not a 
strict upper limit for the empirical missed alarm probability, 
even when the hypothesis test is applied to purely random 
data. It is the sum of the preassigned error probabilities that is 30 
an upper limit for the sum of the empirical error probabilities. 
Thus it is possible with purely random data for one of the 
empirical error probabilities to exceed its corresponding pre- 
assigned error probability, but both empirical error probabili- 
ties cannot be greater than their corresponding preassigned 35 
error probabilities. 

True upper bounds for the empirical error probabilities can 
be determined from the inequalities in equation E36, 


( 1 -/ 3 ') ( 1-/31 


and ft <, 


P 


P 


(l -a') (1 — or) ' 
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(E39) 


For small preassigned error probabilities, the true upper 45 
bounds are only slightly greater than the preassigned error 
probabilities. For instance, if both of the preassigned error 
probabilities are 0.01 then the empirical error probabilities of 
the test will not exceed 0.0101 . 

The Sequential Probability Ratio Test (SPRT): 50 

The sequential hypothesis tests described herein above are 
general statistical tests valid for any pair of related hypoth- 
eses. MSET has historically employed four specific sequen- 
tial hypothesis tests to detect signal faults. These four tests are 
called the Sequential Probability Ratio Tests, or SPRTs. The 55 
SPRTs monitor for changes in the statistical characteristics of 
the residual signals. A residual signal is the difference 
between an actual signal and MSET’s estimate of that signal. 
The SPRTs continually monitor a residual signal, generating 
sequences of decisions. A decision in which the null hypoth- 60 
esis is accepted (i.e., A . „=A ) is called a normal decision and 
implies that the residual signal is behaving as anticipated. A 
decision in which the alternative hypothesis is accepted (i.e., 

Aj : : „=B) is called a fault decision and implies that the residual 
signal is behaving abnormally. 65 

Hie null hypothesis upon which the SPRT s are based speci- 
fies that the residual signal consists of Gaussian data that have 
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a sample mean of 0 and a sample variance of c r. A training 
procedure, during which the system is operating normally, is 
used to verify that the mean of the signal is 0 and to evaluate 
the variance of the signal. Note that if the residual signal does 
not have a mean of 0, the calculated mean from the training 
phase is used to normalize the residual signal for the surveil- 
lance procedure using the model. Thus the null hypothesis, 
H 0 , for the SPRTs is that the signal being analyzed adheres to 
Gaussian PDF (N(y; u. cr 2 )), 


N(y; /i, cr 2 ) = - 


'V77 




(E40) 


for which the mean (p) is 0 and the variance is a 2 . Thus, the 
conditional probability that a discrete observation (y ; .) occurs 
given the null hypothesis is expressed as 

P(y^H o )=N(y i ;0,a 2 ) (E41) 

The four SPRT tests historically used with MSET are the 
positive mean test, the negative mean test, the nominal vari- 
ance test, and the inverse variance test. For the positive mean 
test, the corresponding alternative hypothesis, H 1: is that the 
signal data adhere to a Gaussian PDF with mean +M and 
variance cr, where M is the preassigned system disturbance 
magnitude for the mean test. For the negative mean test, the 
corresponding alternative hypothesis, H,, is that the signal 
data adheres to a Gaussian PDF with mean -M and variance 
cr. Referring to FIG. 10, the hypotheses for the two mean 
tests are illustrated for M=2 and cr=l . For the nominal vari- 
ance test, the alternative hypothesis, H 3 , is that the signal data 
adheres to a Gaussian PDF with mean 0 and variance Vcr, 
where V is the preassigned system disturbance magnitude for 
the variance test. Forthe inverse variance test, the correspond- 
ing alternative hypothesis, H 4 , is that the signal data adheres 
to a Gaussian PDF with mean 0 and variance crW. Referring 
to FIG. 11 . the hypotheses for the two variance tests are 
illustrated for V=2 and a 2 = 1. The conditional probabilities 
that a discrete observation (y,) occurs given one of the four 
alternative hypotheses are expressed as 


P(yi | H , ) = N(yr, M, a 2 ), 


(E42) 


P(y, I H 2 ) = N(y,-, -M, cr 2 ). 


P(y, | tf 3 ) = N(yr, 0, Vcr 2 ), 
P(y, | H 4 ) = /vjy,-; 0, y j. 


The logarithm of the test statistic for the four SPRT tests 
can be evaluated by substituting the conditional probabilities 
from equations E41 and E42 into the general fonnula (equa- 
tion E27) and simplifying. Thus, for the positive mean test, 


lnA u = 



N(yy, M , cr 2 ) 
Niy;; 0, cr 2 ) 



In 


expf-^-M) 2 ) 



(E43) 
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-continued 


E yf - (y: - M ) 2 

2cr 2 



= SPRT p OT . 


The equation B43 defines the SPRT index for the positive 
mean test (SPRT ' ). A SPRT index is defined for each of the 

four sequential hypothesis tests. The SPRT indices are the 
actual quantities computed by the SPRT fault detection pro- 
cedure. The SPRT index for the negative mean test (SPRT n(g ) 
is given by 
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The SPRT index for the nominal variance test (SPRT„ om ) is 
given by 
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The SPRT index for the inverse variance test (SPRT jHV ) is 
given by 


alarm probabilities may not be achieved. The novel advanced 
fault detection method of the instant invention, herein named 
the Adaptive Sequential Probability (ASP) technique or 
method, solves the problem associated with non-Gaussian, 
5 non- white residual signals. The ASP method uses binary 
hypothesis tests that are numerically timed to better accom- 
modate the non-Gaussian data distributions that are typical 
for residual error signals produced by parameter estimation 
based techniques, such as MSET. 

1° Mathematical Foundations of the ASP Test: 

The Adaptive Sequential Probability (ASP) method 
defines four new sequential hypothesis tests. The Adaptive 
Sequential Probability method is an advanced failure detec- 
tion technique that broadens the domain of applicability of 
15 the SPRT technique to non-Gaussian PDFs. In the ASP 
method, the assumption that the data fit a Gaussian PDF is 
relaxed and the test statistic is evaluated for any arbitrary data 
distribution. In the ASP method, the signal is assumed to 
consist of random observations that adhere to a specific PDF 
20 that is a function of the sample mean, variance, and possibly 
higher order terms. The PDF is denoted by the general func- 
tion 3(y; p, cr, . . . ). The parameter list of the function is 
open-ended to indicate that additional terms, such as the 
sample skewness, kurtosis, or width of the distribution at 
, . half-maximum, may be required to characterize the function. 
The null hypothesis upon which the ASP tests are based 
specifies that the data distribution has a sample mean of 0 and 
a sample variance of cr. A training phase, during which the 
system is operating normally, is used to verily that the mean 
3Q of the signal is 0 and to evaluate the sample variance. If the 
PDF is dependent upon any other additional terms, they are 
also evaluated during the training phase for numerically tun- 
ing one or more probability functions obtained by fitting an 
open-ended general probability functions to one or more data 
distributions obtained during typical or normal operating 
35 conditions of the asset under surveillance. Thus, the condi- 
tional probability that a discrete observation (y ; ) occurs given 
the null hypothesis is expressed as 

Ptv.-LffohSfe 0 , o 2 , . . . ). (E47) 
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Novel Improvements to the Fault Detector Training and 
Surveillance Procedures: 

Having described the SPRT fault detector training and 
surveillance methods herein above, the section below 
describes the novel improvements made by the instant inven- 
tion when used for fault detector training and surveillance, 
said improvements being applicable to any asset preferably 55 
having at least one characteristic data signal. 

The Adaptive Sequential Probability (ASP) Test: 

Hie SPRT technique is valid only for residual signals that 
are random and adhere to Gaussian distributions. Because the 
MSET parameter estimation algorithm simulates the deter- 60 
ministic components of a set of inter-related raw signals, the 
residual signals that are produced within MSET are often 
white-noise processes (i.e., not serially-correlated). Although 
the MSET residual signals are typically white, they are often 
non-Gaussian (not normally distributed). If the SPRT fault 65 
detection model is applied to signals that are either non- 
Gaussian, non- white, or both, the theoretical false and missed 



Typically, four sequential hypothesis tests (i.e., a positive 
mean test, a negative mean test, a nominal variance test, and 
a inverse variance test) are utilized by the ASP method. For 
the positive mean test, the corresponding alternative hypoth- 
esis, Hj, is that the signal data adhere to the specified PDF 
with mean +M and variance cr where M is a preassigned 
system disturbance magnitude for the mean test. For the nega- 
tive mean test, the corresponding alternative hypothesis, H 2 , 
is that the signal data adheres to the specified PDF with mean 
-M and variance a 2 . For the nominal variance test, the cor- 
responding alternative hypothesis, H 3 , is that the signal data 
adhere to the specified PDF with mean 0 and variance Vcr 2 
where V is a preassigned system disturbance magnitude for 
the variance test. For the inverse variance test, the correspond- 
ing alternative hypothesis, H 4 , is that the signal data adhere to 
the specified PDF with mean 0 and variance cr/V. The con- 
ditional probabilities that a discrete observation (y,.) occurs 
given one of the four alternative hypotheses are expressed as 
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Hie logarithm of the test statistic for the four ASP tests can 
be evaluated by substituting the conditional probabilities 
from equations E47 and E48 into the general formula (equa- 
tion E27). The logarithm of the test statistic for a given test is 
defined to be the ASP index for that test. The ASP index for 5 
the positive mean test (ASP pos ) is given by 


28 

A PDF can be written as a sum of a standard normal 
distribution, Z(x), and a remainder term, R(x), as described in 
Mathematical Methods of Statistics, by H. Cramer, Princeton 
University Press, Princeton, N.J., 1946. 

3(x)=Z(x)+R(x), (E53) 

where the standard normal PDF 
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The ASP index for the negative mean test (ASP„ eg ) is given 
by 


15 


is a function of the dimensionless variable x=(y-p)/a. The 
standard normal PDF is related to the general Gaussian PDF 
(equation E40) through the normalization condition: 
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The ASP index for the nominal variance test (ASP„ om ) is 
given by 
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The ASP index for the inverse variance test (ASP ZWV ) is 
given by 
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The ASP tests are then implemented in the same manner as 
the SPRT tests. Namely, for each time step in the calculation, 
the four ASP indices are calculated (equations E49 through 
E52). Each ASP index is compared to the upper and lower 
log-thresholds and the status of the test is evaluated (i.e., 45 
faulted, normal, or continuing). 

ASP Method for Near-Gaussian Distributions: 

The Adaptive Sequential Probability method consists of 
four specific sequential hypothesis tests applied to non-Gaus- 
sian data distributions. In order to use the method, a general 50 
PDF, 3(y; p; cr, . . . ), must be defined for the target signal. In 
this section, the ASP method is derived for data distributions 
that are nearly Gaussian. 

In applications of MSET to the Space Shuttle Main Engine 
accelerometer signals, applicants discovered that the residual 55 
signals produced by the system model have a nearly, but not 
truly, Gaussian behavior, as described in Dynamics Sensor 
Data Validation Phase I Final Report, by applicants: Randall 
L. Bickford and James P. Herzog, NASA Contract NAS8- 
40874, Jul. 1 , 1 997. When plotted as a histogram, the residual 60 
data will appear to have the same bell -curve shape as a Gaus- 
sian distribution. But when a Gaussian PDF of the same 
mean, standard deviation, and area as the data distribution is 
superimposed on it, the histogram was found by applicants to 
be non-Gaussian. Typically, applicants found that the histo- 65 
gram has thicker tails than the Gaussian curve, which corre- 
sponds to a sample kurtosis that is greater than 3. 


The remainder term can be expressed as an infinite series of 
orthogonal polynomials whose terms are functions of deriva- 
tives of the standard normal PDF. The constant coefficients of 
the polynomial terms are dependent on the central moments 
of the target distribution. As described in Handbook of Math- 
ematical Functions, by M. Abramowitz and I. A. Stegun, 
Dover Publications, Inc., New York, 1 972, the first four terms 
in the series expansion of the remainder function are: 
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The jy factors are central moments of the discrete data 
sequence {Y„}=(y 1; y 2 , . . . , yj, 

If 1 fk < E57 1 
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The Z, (n> (x) functions are derivatives of the standard normal 
PDF. The nth derivative of the standard normal PDF is given 
by: 


Z'"{x) S ^Z(jt) = (-1 ) n He„{x)Z{x), 
(hr 
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where He„(x) are the Hennite polynomials. The first 
twelve Hermite polynomials are 

He 1 (x)=x, 

He 2 (x)=x 2 -1, 


-continued 
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He 3 (x)=x 3 -3x, 

He 4 (x)=x 4 -6x 2 +3, 

He 5 (x)=x 5 - 1 Ox 3 + 1 5x, 

He 6 (x)=x 6 - 1 5x 4 +45x 2 - 1 5 , 
He 7 (x)=x 7 -21x 5 +105x i - 1 05x, 
He 8 (x)=x 8 - 28x 6 +2 10x 4 -420x 2 +l 05 , 
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15 Thus, the series expansion formula for the approximation 
of the general PDF is 


/7e 9 (x)=x 9 -36x 7 +378x 5 -1260x 3 +945x, 

He lo (x)=x lo -45x 8 +630x 6 -3150x 4 +4725x 2 -945, 
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i7e u (x)=x 11 -55x 9 +990x 7 -693at 5 +17325x 3 -10395x, 

He 12 (x)=x i2 -66x lo +14&5x s -13&60x 6 +51975x 4 - 

6237Qx 2 +10395. (E59) 

Each Term in the Series Expansion of the Remainder Func- 
tion R(x) is More complex than the previous term as higher 
central moments of the data distribution and higher deriva- 30 
fives of the standard normal PDF are introduced. The nth term 
in the remainder function depends on the central moments 
from order 1 (i.e., mean) through order n+2. Thus, the first 
term depends on moments through the third central moment 
(i.e., skewness), the second term depends on moments 35 
through the fourth central moment (i.e., kurtosis), and so on. 

If the data distribution that is to be approximated with 
equation E53 is nearly Gaussian, a good approximation will 
be achieved with only a few of the terms in the series. In the 
ASP method for near-Gaussian distributions, the data distri- 40 
bution is first approximated with the standard nomial PDF, 
Z(x). The approximation is refined by adding successively 
higher terms of the remainder function until an adequate fit 
between the general PDF, 3(x), and the data distribution is 
produced. In practice it is usually not advisable to go beyond 45 
the fourth term in the remainder function because the tail 
regions of the general PDF become unstable. 

Using the equations for the derivatives of the standard 
normal PDF (equations E58 and E59), the remainder function 
can be written as a product of the standard normal PDF and a 50 
polynomial whose coefficients depend on the central 
moments of the data distribution. 


R(x) = Z(.y)^F n (x). 
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where the first four terms in the series are 
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Application of the ASP Method to Space Shuttle Main 
Engine Residual Signals: 

In one application use and operation, and in one preferred 
embodiment, the ASP technique was reduced to practice to 
provide highly reliable data validation for high frequency 
Space Shuttle Main Engine (SSME) accelerometers. Each 
SSME is monitored by eight high frequency (10.24 kHz) 
accelerometers with three sensors mounted on each of the two 
high pressure turbopumps and two sensors mounted on the 
engine gimbal bearing. Several MSET models were prepared 
to validate the ASP method forthe six turbopump accelerom- 
eters. The MSET pattern recognition algorithm produces 
extremely accurate simulations of turbopump accelerometer 
behavior. Referring to FIG. f2, some results from applying 
the six sensor MSET model to a typical Space Shuttle flight 
(i.e., flight STS057, engine 1) are shown. For the calculation 
shown in FIG. 12, the MSET training algorithm extracted a 
small fraction of the data in the accelerometer signals to 
simulate the behavior of the accelerometers (the process 
memory matrix contained only 0.0083% of the raw signal 
data). In the top plot, the highly dynamic signal from the first 
SSME turbopump accelerometer in the model (i.e., the oxi- 
dizer prebumer pump 45 accelerometer) is shown as a func- 
tion of time from the launch of the Space Shuttle. In the 
middle plot, MSET’s estimate of the accelerometer signal 
shows the very high accuracy of the MSET simulation. The 
residual signal for the calculation, which is the difference 
between the sensor signal and MSET’s estimate of that signal, 
is shown in the bottom plot. The relative error in the calcula- 
tion, as measured by the ratio of the standard deviation of the 
residual signal to the standard deviation of the sensor signal, 
is only 0.86%. The results shown in FIG. 12 illustrate 
MSET’s ability to accurately model highly dynamic signals, 
even when only a tiny fraction of available data is used to 
represent the signal behavior. 

Still referringto FIG. 12 , the residual signal forthe analysis 
of a typical SSME accelerometer appears to be random. 
Referring to FIG. 13 , the distribution of data in the residual 
signal is shown. The probability density forthe residual signal 
was approximated by dividing the data range into m equally- 
sized bins and then counting the number of residual signal 
data values that fall into each bin. The count number is a 
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discrete function of the sequence {Y m }=y 1 , y 2 , . . . , y m , where 
the ith element in the sequence (yj is given by 


y, = jw> 


,(y. 

+ l 


m 


(E 66) 5 


aid y min is the minimum datum and y max is the maximum 
datum in the residual signal. A normalized residual signal 
PDF is produced by dividing each element in the count num- 
ber function by the total number of elements in the residual 
signal. The number of bins used to produce the residual signal 
PDFs was 1000. As shown in FIG. 13, a PDF for a typical 
residual signal has a nearly Gaussian shape. Superimposed on 
top of residual signal PDF is a Gaussian PDF of the same 15 
mean, standard deviation, and area as the residual signal PDF. 
The first four moments of the residual signal data are as 
follows: mean=l.l*10 -4 g, standard deviation=8.2*10 -2 g, 
skewness=0.32, and kurtosis=5.7, where the skewness and 
kurtosis are related to the third and fourth central moments of 20 
the data by 


skewness = U and kurtosis = (E67) 

cr 3 c r 4 do 

Because the kurtosis of the distribution is greater than 3, the 
residual signal PDF has thicker tail regions than a true Gaus- 
sian distribution. This is confirmed by a visual examination of 30 
FIG. 13. 

The ASP method for near-Gaussian distributions was 
applied to the residual signals from the MSET model of the 
SSME accelerometers. The residual signal PDFs were 
approximated with the series expansion formula of a general 35 
PDF (equations E60 through E65). Four calculations were 
performed for each residual signal in the model. In the first 
calculation, the residual signals were approximated with the 
one-term series expansion formula. In the one-term formula, 
the remainder function (R(x)) is given by the product of the 40 
standard normal PDF and the first term in the series, r^x). 
Subsequent calculations introduced additional terms from the 
series, culminating with the four-term formula in which the 
remainder function is given by the product of the standard 
normal PDF and the four term series, r 1 (x)+r 2 (x)+r 3 (x)+r 4 (x). 45 

For each calculation, the quality of the approximation is 
measured by the root mean-squared (rms) error of the calcu- 
lation. The rms error (E,. ms ) is a function of the difference 
between the calculated PDF (3(x)) from equation E65 and the 
residual signal PDF (F(x)), 50 
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The calculated and residual signal PDFs are discrete func- 
tions of the dimensionless variable x, (y,-p)/ a, where y l is an 
element in the sequence that spans the range of data in the 
residual signal distribution (see equation E66). 60 

In general terms, each additional term in the series expan- 
sion improves the approximation of the residual signal PDFs. 
Referring to FIG. 14 , FIG. 15 and FIG. 16 , adding terms from 
the series expansion formula improves the fit of the residual 
signal PDF from the first sensor. The approximation of the 65 
residual signal PDF generated by the one-term series expan- 
sion formula is compared to a Gaussian approximation and 


the residual signal PDF in FIG. 14 . The one-term approxima- 
tion shows little improvement over the Gaussian approxima- 
tion. The mis error for the Gaussian PDF is 0.414, while the 
mis error for the one-term series expansion PDF is slightly 
smaller at 0.409. The PDF generated by the two-term series 
expansion formula is shown in FIG. 15 . The fit of the two- 
term approximation is significantly better than that of the 
Gaussian approximation. The two-term formula provides an 
excellent fit of the residual signal PDF, especially near the 
peak in the distribution. The greatest source of error in the 
approximation occurs in the transition regions of the distri- 
bution, between 1 and 4 standard deviations from the mean. 
Therms error for the two-term series expansion PDF is 0. 122, 
which is nearly a factor of four smaller than the mis error of 
the Gaussian PDF. The PDF generated by the three-term 
series expansion formula is shown in FIG. 16 . The three-term 
formula provides a better fit of the peak in the distribution 
than does the two-term formula. The mis error for the three- 
term approximation is 0.151, which is slightly larger than that 
of the two-term approximation because its fit of the lower 
transition region is less accurate than that of the two-term 
approximation. 

The mis errors of the four calculations for each of the 
sensors in the model are listed in FIG. 17 . Also included in 
FIG. 17 are the nns errors for Gaussian PDFs of the same 
mean, standard deviation, and area as the residual signal 
PDFs. As indicated by the data in the first two columns of 
FIG. 17 , the one-term series expansion formula produces 
PDFs that are slightly more accurate than the corresponding 
Gaussian. PDFs. Significant improvement is exhibited by the 
two-term series expansion formula, which produces PDFs 
whose mis errors are as much as a factor of four smaller than 
those from the corresponding Gaussian and one-tenn series 
expansion approximations. The three-term series expansion 
formula generally produces larger rms errors than does the 
two-term formula because the fit of the transitions regions of 
the distribution is less accurate. This trend is further exhibited 
by the four-term series expansion formula. For three of the 
calculations (i.e., sensor numbers 1, 2, and 6), the four-term 
approximations are unstable in the transition regions, result- 
ing in rms errors that are much larger than those of the cor- 
responding Gaussian approximations. The four-term series 
expansion formula produces an improved approximation for 
the residual signal from sensor number 5 only. The residual 
signal for this accelerometer is the most nearly Gaussian of 
the sensors to begin with, as indicated by the mis errors for the 
Gaussian approximations for all six sensors. These results 
suggest that the higher order terms in the series expansion 
should be used with caution: only those PDFs that are nearly 
Gaussian to begin with should be approximated with terms 
higher than the second term in the series expansion. 

In the ASP method for near-Gaussian distributions, the 
central moments of the distribution to be approximated are 
used to evaluate the coefficients in the series expansion. A 
second approach, known as the optimized ASP method for 
near-Gaussian distributions, was also evaluated. In the second 
approach, the higher-order central moments (i.e., p 3 , p 4 , p. 5 , 
and p 6 ) are treated as free parameters in the model and the 
approximation is optimized by searching for values of the 
central moments that provide the best fit of the distribution. In 
this approach, the simplex optimization method is used to 
minimize the mis error of the calculation, as described in A 
Simplex Method for Function Minimization, by J. A. Nelder 
and R. Mead, Computer Journal, Vol. 7, 1965, at pages 308 
through 313. The simplex method is particularly useful for 
minimizing complicated transcendental functions of two or 
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more variables because it minimizes a function by utilizing 
evaluations of the function only — no evaluations of function 
derivatives are required. 

n the optimized ASP method for near-Gaussian distribu- 
tions, the first step is to select the approximation formula to be 5 
optimized, either the one, two, three, or four-term series 
expansion formula. The number of free parameters in the 
model is determined by the approximation formula used. For 
instance, the one-term formula requires only one free param- 
eter (i.e., p 3 ), whereas the four-term formula requires four 10 
free parameters (i.e., p 3 , (t 4 , p. 5 , and |i 6 ). The function that is 
minimized by the simplex algorithm is the rms error for the 
approximation, given by equation E68. Hie higher-order cen- 
tral moments of the distribution are used as initial values of 
the free parameters in the model. The simplex algorithm 
iterates on the free parameters until a minimum in the func- 
tion is found, thereby producing the best fit (i.e., smallest rms 
error) between the calculated PDF and the residual signal 
PDF, for a given approximation formula. 

The rms errors of the calculations using the optimized ASP 
method for near-Gaussian distributions are listed in FIG. 18. 20 
Also included in FIG. 18 are the rms errors for Gaussian PDF s 
of the same mean, standard deviation, and area as the residual 
signal PDFs. Comparing the rms errors of the Gaussian 
approximations to those from the optimized calculations 
shows that all optimized series expansion formulas produce 25 
significant reductions in the rms error. Comparing the data in 
FIG. 18 to those of FIG. 17 shows that the optimized ASP 
method produces more accurate approximations than does 
the ASP method for each series expansion formula. For a 
given sensor, the optimized two-term series expansion for- 30 
mula produces rms errors that are roughly a factor of two 
smaller than those of the optimized one-term formula. The 
optimized three-term and four-term series expansion formu- 
las though, produce rms errors that are roughly equivalent to 
those of the optimized two-term formula. Since the three- 
term and four-term models incorporate more complicated 
functions with more free parameters than the two-term 
model, these results indicate that the optimized two-term 
series expansion method provides the best balance between 
accuracy and efficiency. 

The PDF generated by the optimized two-term series 40 
expansion formula for the first sensor is shown in FIG. 19. 
The optimized two-term formula provides an excellent fit of 
the residual signal PDF across all regions of the distribution, 
including the transition regions. Comparing the results of the 
optimized ASP calculation in FIG. 19 to those from corre- 45 
sponding ASP calculations in FIG. 15 and FIG. 16, reveals 
that the main effect of the optimization is to improve the 
accuracy of the approximation through the transition regions 
of the distribution. 

The optimized ASP method for near-Gaussian distribu- 50 
tions produces an excellent fit of the residual signal distribu- 
tions from the MSET model of SSME accelerometer signals. 
Because the optimized ASP method is tuned to the specific 
data distributions analyzed, the empirical error probabilities 
for the method will better adhere to the theoretical limits for 
the fault detection procedure. 

A parametric study of the empirical false alarm rate was 
performed to compare the false alarm of the SPRT method to 
the new ASP method for the SSME residual signals. In the 
study, the residual signals generated by applying the six sen- 
sor MSET model to the accelerometer signals from flight 60 
STS057 engine 1 were analyzed with the SPRT and ASP 
sequential hypothesis tests at various values of the false alarm 
probability and the system disturbance magnitude param- 
eters. Because of the high frequency of the accelerometer 
signals (10.24 kHz) and long duration of the flight (542.4 65 
sec), the residual signals contain over five million data values. 
The residual signal data set was so large that every tenth point 
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was extracted from the signals and the fault detection models 
were applied to the subset. Hie subset of the residual signals 
was large enough to produce accurate statistics but small 
enough to permit multiple calculations in a reasonable time 
span. Typically, the sequential hypothesis tests make a deci- 
sion for every 1 to 10 data points analyzed. Thus for the 
parametric calculations of the SSME residual signals, hun- 
dreds of thousands of decisions were made for each signal. By 
dividing the number of fault decisions from a SPRT analysis 
of a residual signal into the total number of fault and normal 
decisions made by the test, the empirical false alarm rate 
could be accurately evaluated. 

Referring to FIG. 20, the results from the application of the 
positive mean SPRT and ASP tests to the residual signal from 
the first sensor are shown. The calculations shown in FIG. 20 
were performed with a constant preassigned missed alarm 
probability (|f) of 0.01, a constant system disturbance mag- 
nitude (M) of 6, and a variable preassigned false alarm prob- 
ability (a). The solid diagonal line in the figure represents the 
theoretical upper bound for the empirical false alarm prob- 
ability, as defined by equation E39. For the SPRT calculation, 
the empirical false alarm rate satisfies the theoretical upper 
bound only for preassigned false alarm probabilities greater 
than 0.003 . For smaller values of the preassigned false alarm 
probability, the SPRT model reaches more fault decisions 
than one would expect based on theoretical arguments. The 
theoretical upper bound for the empirical false alarm prob- 
ability is not met for all values of the preassigned false alarm 
probability because the residual signals are not purely Gaus- 
sian, as confirmed in FIG. 13. The residual signal distribution 
has more data populating the tail regions than does a Gaussian 
distribution of the same mean, standard deviation, and area as 
the residual signal distribution. The data from the tail regions 
of a distribution tire a major source of false alarms in the SPRT 
calculations, because they are more representative of the 
alternative hypothesis than they are of the normal hypothesis. 
Because the residual signals exhibit more heavily populated 
tail regions than true Gaussian distributions, they have a ten- 
dency to trigger more false alarms than anticipated, especially 
at high levels of sensitivity (i.e., small values of the false 
alami probability). Because the optimized ASP method is 
tuned to the specific data distributions exhibited by the 
residual signals, the empirical false alarm rate for the ASP 
calculation satisfies the theoretical upper bound for all values 
of the preassigned false alarm probability. The ASP method 
produces fewer false alarms than the SPRT method for all 
values of the preassigned false alarm probability. 

ASP Benefits and Applications: 

The Adaptive Sequential Probability (ASP) technique was 
developed as the fault detection element of a software pro- 
gram that reliably detects signal data faults for an asset, such 
as a process and/or apparatus. These signal validation mod- 
ules improve safety, reduce maintenance cost, and enable 
optimal performance for a wide range of aeronautical, indus- 
trial, chemical, power generating, medical, biological, finan- 
cial, and military assets. Signal validation is required in all 
types of process critical control and safety systems where 
unexpected process interruptions due to sensor or control 
component failures or false alarms are unsafe or uneconomi- 
cal. Signal validation assures the safe, reliable operation of a 
process or apparatus and reduces the manpower, schedule and 
uncertainty associated with sensor and component failure 
detection. Signal validation prevents a facility safety or con- 
trol system from making critical decisions, such as the deci- 
sion to shut down a process or abort a mission, on the basis of 
bad sensor data. Signal validation improves process quality 
and efficiency by ensuring that closed loop control or data 
analysis is performed using good data. Finally, signal valida- 
tion increases system up time and decreases system mainte- 
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nance cost by enabling sensor calibration using on-condition 
criteria rather than time-in-service criteria. 

Moreover, having thus described the invention, it should be 
apparent that numerous structural modifications and adapta- 
tions may be resorted to without departing from the scope and 5 
fair meaning of the instant invention as set forth hereinabove 
and as described herein below by the claims. 

We claim: 

1 . A method for performing surveillance an asset, the steps 

including: 10 

acquiring residual data correlative to expected asset opera- 
tion; 

fitting a mathematical model to the acquired residual data 
and storing the mathematical model in a memory means; 

collecting a current set of observed signal data values from 1 5 
the asset; 

using the mathematical model in a sequential hypotheses 
test performed as a function of at least one of the 
observed signal data values for detecting a fault condi- 
tion; and 20 

determining if a detected fault condition is indicative of an 
actual presence of a fault for providing asset surveil- 
lance. 

2. The method of claim 1 wherein the step of acquiring 
residual data includes a step of acquiring residual data cor- 2 s 
relative to at least one mode of asset operation. 

3. The method of claim 2 wherein the step of acquiring 
residual data correlative to at least one mode of asset opera- 
tion includes a step of acquiring past observations of signal 
data values from the asset and obtaining deviations between , () 
the past observations of signal data values and expected signal 
values obtained by using a model of expected asset operation. 

4. The method of claim 1 wherein the step of fitting the 
mathematical model to the acquired residual data and storing 
the mathematical model in the memory means includes a step 

of fitting at least one mathematical function correlative to a 35 
frequency distribution to the acquired residual data. 

5. The method of claim 4 wherein the step of fitting at least 
one mathematical function correlative to a frequency distri- 
bution to the acquired residual data includes a step of fitting at 
least one function defining a frequency curve to the acquired 40 
residual data. 

6. The method of claim 4 wherein the step of fitting at least 
one mathematical function correlative to a frequency distri- 
bution to the acquired residual data includes a step of fitting at 
least one function defining a frequency histogram to the 45 
acquired residual data. 

7. The method of claim 4 wherein the step of fitting at least 
one mathematical function correlative to a frequency distri- 
bution to the acquired residual data includes a step of fitting at 
least one function defining a frequency polygon to the 50 
acquired residual data. 

8. The method of claim 1 wherein the step of using the 
mathematical model in a sequential hypotheses test per- 
formed as a function of at least one of the observed signal data 
values for detecting a fault condition includes the step of . . 
performing at least one of a group of sequential hypothesis 
tests comprised of a positive mean test, a negative mean test, 

a nominal variance test, and a inverse variance test. 

9. The method of claim 1 further including a step of per- 

forming a control action when an actual presence of a fault is 
determined. 60 

10. The method of claim 1 further including a step of 
sounding an alarm when an actual presence of a fault is 
determined. 

11. A method for performing surveillance of an asset, the 
steps including: 
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acquiring residual data correlative to expected asset opera- 
tion; 

fitting a mathematical model to the acquired residual data 
and storing the mathematical model in a memory means; 

collecting a current set of observed signal data values from 
the asset; 

using the mathematical model in a sequential hypotheses 
test performed as a function of at least one of the 
observed signal data values for determining indices cor- 
relative to asset status; 

evaluating the indices for detecting a fault condition; and 

determining if a detected fault condition is indicative of an 
actual presence of a fault for providing asset surveil- 
lance. 

12. The method of claim 11 wherein the step of acquiring 
residual data includes a step of acquiring residual data cor- 
relative to at least one mode of asset operation. 

13. The method of claim 11 wherein the step of fitting the 
mathematical model to the acquired residual data and storing 
the mathematical model in the memory means includes a step 
of fitting at least one mathematical function correlative to a 
frequency distribution to the acquired residual data. 

14. The method of claim 13 wherein the step of fitting at 
least one mathematical function correlative to a frequency 
distribution to the acquired residual data includes a step of 
fitting at least one function defining a frequency curve to the 
acquired residual data. 

15. The method of claim 13 wherein the step of fitting at 
least one mathematical function correlative to a frequency 
distribution to the acquired residual data includes a step of 
fitting at least one function defining a frequency histogram to 
the acquired residual data. 

16. The method of claim 13 wherein the step of fitting at 
least one mathematical function correlative to a frequency 
distribution to the acquired residual data includes a step of 
fitting at least one function defining a frequency polygon to 
the acquired residual data. 

17. The method of claim 1 1 wherein the step of using the 
mathematical model in a sequential hypotheses test per- 
formed as a function of at least one of the observed signal data 
values for detecting a fault condition includes the step of 
performing at least one of a group of sequential hypothesis 
tests comprised of a positive mean test, a negative mean test, 
a nominal variance test, and a inverse variance test. 

18. The method of claim 11 further including a step of 
performing a control action when an actual presence of a fault 
is determined. 

19. The method of claim 11 further including a step of 
sounding an alarm when an actual presence of a fault is 
determined. 

20. A method for performing surveillance of an asset, the 
steps including: 

acquiring residual data correlative to expected asset opera- 
tion; 

fitting a mathematical model to the acquired residual data 
and storing the mathematical model in a memory means ; 

collecting a current set of observed signal data values from 
the asset; 

using the mathematical model in a sequential hypotheses 
test performed as a function of at least one of the 
observed signal data values for determining a time series 
of indices correlative to asset status; 

evaluating the time series of indices for detecting a fault 
condition; and 

determining if a detected fault condition is indicative of an 
actual presence of a fault for providing asset surveil- 
lance. 





